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Staggered fermions and the fourth root trick 



1. Introduction and overview 

Staggered fermions [[I]] are commonly used for numerical computations in lattice QCD, in 
which it is now becoming standard to include the fermion determinants for all three light quark 
flavors, up, down, and strange. They have an exact chiral £7(1) symmetry at zero quark mass [0], 
implying that the critical value of the bare quark mass for each quark flavor is known to be zero. 
Furthermore, staggered fermions are numerically inexpensive, and the combination of these two 
facts has made it possible to reach the chiral regime of light pseudo-scalar Goldstone boson (GB) 
masses essential for phenomenology. 

In practice, a separate staggered fermion field is introduced for each physical flavor, with a 
single-site mass term to describe their masses m u , and m s . However, each staggered fermion 
describes four degenerate quarks in the continuum limit, commonly referred to as "tastes," and one 
ends up with a theory with four up, four down, and four strange quarks, with (for non-degenerate 
quark masses) a U (4)„ x U (4)j x U (4). s vector-like symmetry. In order to remedy this problem, 
one starts from the observation that if indeed each staggered fermion field describes four degenerate 
tastes in the limit of vanishing lattice spacing a, one would expect that the determinant of the 
staggered Dirac operator D stag factorizes as 

Det(D ifflg (m))~Det 4 (D 

continuum 

(m)), a^O, (1.1) 

where m is the quark mass. The idea is then to take the fourth root, Det l ^ 4 (D stag ) for each flavor 
in the generation of the gauge-field ensemble on which observables are computed. At the diagram- 
matic level, this multiplies each sea-quark loop by 1/4, thus correcting for the too-many tastes per 
quark flavor present in the theory without the fourth root. Since the staggered determinant is posi- 
tive for any and the continuum determinant is (formally) positive for any m > 0, one picks 
the positive fourth root. 1 Since the continuum determinant is only positive for m > 0, the trick only 
works for positive physical quark mass (see, however, Refs. [^, ||]). 



Obviously, the argument for this trick is heuristic. Factorization as in Eq. (1.1) can never be 
exact: first, the right-hand side is not well-defined, and second, the ultraviolet (UV) eigenvalues of 
Dstag will never form taste quartets. But we expect that quartets of eigenvalues will form at physical 
scales for small enough lattice spacing, while UV effects can be absorbed into a renormalization 
of the gauge coupling and masses, and thus that factorization will effectively take place in the 
continuum limit, validating the use of the fourth-root trick. We note that this expectation is based 
on the generally-assumed properties of the continuum limit of unrooted staggered fermions (to 



which Eq. ( |1.1| ) refers), about which there is little doubt. This observation plays a key role in 
establishing the validity of regulating QCD with rooted staggered fermions [Q]. 

We thus face the following set of questions, all of which need to be answered to establish the 
validity of working with rooted staggered fermions: 

1. The prescription for regulating QCD with staggered fermions and the fourth-root trick is 
unambiguous and easy to implement. The question is whether this prescription is a regulator 
like any other or not. We will argue in Sec. 2 that it is not, in the sense that the theory is 
non-local at a ^ [Sj. 



'A negative staggered quark mass can be made positive through a U(l) £ rotation. 
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2. Given this result, the next question is whether the continuum limit can be taken, and whether 
that limit is in the correct universality class. In Sec. 3 we discuss a renormalization-group 
(RG) framework through which to address this question To motivate the use of the RG, 
we note that it deals with two issues already mentioned: it separates UV scales from physical 
scales, and it makes it possible to define a U (4) taste-invariant theory at some fixed coarse 
lattice spacing a c -C Ag^, obtained by blocking the original staggered theory on a fine 
lattice with spacing a = cif an infinite number of times (so that a.f/a c — ► 0). In other words, 
it helps in defining the right-hand side of Eq. ( |1.1[ ). Our arguments give strong evidence that 
indeed the non-locality vanishes in the continuum limit and the correct universality class is 
obtained using the fourth-root uick, i.e., that it is a valid regulator. All the key elements of 
the argument can be confirmed by perturbative calculations and/or concrete numerical tests. 

3. In view of the answers to questions 1 and 2, a third issue arises. Lattice computations are 
performed at non-zero a, and they are thus affected by the non-local nature of the regulator. 
Even though the answer to the second question implies that these unphysical effects go away 
in the continuum limit, one needs to understand them in order to analyze and fit the data 
generated at a ^ 0. Here, this issue is addressed for the physics of GBs. In Sec. 4 we argue 
that the correct effective theory describing GBs is given by staggered chiral perturbation 
theory (SZPT) plus the replica trick [0]. At a pedestrian level, the replica trick refers to the 
fact that sea-quark loops are corrected by a factor of four "by hand," after identifying the 
quark-flow diagrams underlying a SZPT calculation. The replica trick thus implements the 
fourth root at the level of the effective theory. 

Before embarking on the topics outlined above, let us first consider the continuum limit in 
more detail. If the continuum limit exists and is in the correct universality class, it can be (formally) 
described by the path integral (including sources for mesons) 



-'continuum 



(J) = J '9% e - s ^ a//) Det 1 / 4 ((D(^)+M)<g>l+./) , (1.2) 



where S g {%) is the gauge action. The Dirac operator D{%) carries no taste index, and 1 is the 
4x4 identity matrix acting on the taste index. The mass matrix is M = diag(m u ,md,m s ), with 
all masses positive. The source J is a matrix in spin, flavor and taste space. If we project the 
complete set of correlation functions generated by this partition function onto the taste-singlet set 
by setting J = /ig) 1, the fourth root reduces to Det((D(W) +M) +J), and it becomes obvious 
that the taste-singlet sector is the physical sector, containing precisely the correlation functions 
for unquenched three-flavor QCD. This simple observation implies that no paradoxes based on 
symmetry arguments 2 can arise [^]. 



Of course, many unphysical correlation functions exist in the theory defined by Eq. (U\2J), and 
they can be generated by using a source J that is not proportional to the identity in taste space. 
However, because of the exact SU (4) taste vector symmetry of the theory, flavor non-singlet but 
taste-singlet operators can be related to taste non-singlet ones, for example, 

uftd^ uy 5 Ed, (1.3) 



- as presented for instance in Ref. | 
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where S is an SU (4) taste rotation acting (for example) on the down quark. If we choose S = £5, 
the operator on the right-hand side corresponds to the exact GB related to U(l) e symmetry. This is 



the pion field usually used in simulations. Equation ( |1.3| ) tells us that this is equivalent to using the 
physical operator uy^d. However, if one is interested in flavor-singlet physics, in which mixing with 
gluonic states can occur, one must use taste-singlet operators, in accord with the fact that gluons 



do not have taste. Indeed, the trick of Eq. (|L3j) will not work if both the quark and the anti-quark 
have the same flavor. In cases with exact flavor symmetry, it may nevertheless be possible to rotate 
a flavor-neutral interpolating field first into a flavor-charged one, and then make taste rotations as 
in Eq. (1.3). For example, if m u = m c i, we can rotate a taste-singlet 71° field into a taste-singlet 7l + 
field and then use Eq. ( \l.3\ ) to relate it to a taste-nonsinglet 7i + field. However, this will not work if 
m u 7^ mj, when a K can mix with gluonic states. 

2. Non-locality at a 7^ 

It is straightforward to establish the non-locality of the rooted theory at non-zero lattice spac- 
ing. We proceed by assuming the theory to be local, and derive a contradiction Assume that a 
local Dirac operator D exists such that (at 

Det 1 / 4 ^) = Det(D) exp(-5% / /4) , (2.1) 

with SS e ff a local functional of the gauge field which does not contribute to any long-distance 
effects (apart from a possible renormalization of the gauge coupling). Take the fourth power: 

Det(D. SMg ) = Det(D 4 ,) exp(-55 e// ) , D 4t = D <g> 1 . (2.2) 

The operator describes a theory with an exact U (4) taste symmetry. We may now compare the 
spectra of hadron multiplets in the two theories at non-zero lattice spacing, considering for instance 
the pions. The theory defined by D4, has fifteen degenerate pions 3 in the adjoint representation 
of SU(4). But the theory described by D stag is well known to have a spectrum of fifteen non- 
degenerate pions (with only one "exact" pion corresponding to U{\) e symmetry), due to the fact 
that the staggered symmetry group is much smaller than SU (4) It follows that, contrary to our 
assumption, SS e ff has to know about long distance effects! Thus the staggered theory with the 
fourth root is non-local. 

While this concludes our basic argument, it is instructive to discuss this result in more detail. 



In order to do this, we first go to the so-called taste basis, by defining [jlfj 



D; a ] te = a- l + QD- t l ag Q\ (2.3) 
where a is a constant of order I /a, and Q is a (gauge-covariant) unitary matrix connecting the 



one-component and taste bases flllj]. The contact term a 1 is new, but it has no effect on the long 



distance physics. Choosing a finite (instead of infinite, which would turn Eq. ( |2.3| ) into the usual 



basis transformation of Ref. QUO) is advantageous for setting up the RG framework in the next 



section, as will become clear below. With < a < °°, we have that 



Det(D 4 .^) = Det((aGP) Det(D taste ) , (2.4) 
(aG)- 1 = ^D stag + Q t Q= ^D stag + 1 . 



Here we will use the word "pion" for any meson that becomes an exact GB in the continuum and chiral limits. 
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For a ~ l/a, the effective action defined by logDet((aG) -1 )) is local, because (aG) _I is a lattice 
Dirac operator with a bare mass of order l/a. This effective action vanishes as a — > °°. 
We now may split D, ai , e into taste-invariant and non-invariant parts: 

D taste =D 1 ®l + Y,D A ®Z A , (2.5) 

A 

where the S A are fifteen traceless hermitian matrices spanning the taste space. We thus have that 
logDet(D tofe ) =4^061(00 +logDet(l+Df 1 £D A ®S A ) . (2.6) 

A 

Both D\ and the D A are local, but the effective action defined by the last term on the right-hand 
side is not. On the other hand, the theories defined by \ogDet(D taste ) or logDet(Di) separately 
are obviously local. 4 This makes it clear where the non-locality of the rooted theory comes from: 
it originates directly from the taste breaking present in the unrooted theory. But it also makes it 
clear how it may go away in the continuum limit: the Da are of order a, and thus constitute a set 
of irrelevant operators. If no quantum effects change this observation, taste symmetry should be 
restored in the continuum limit, and, mutatis mutandis (as we will argue in the next section), the 



rooted theory should become local. Equation ( |2.6| ) is an example of Eq. ( |2.2| ) with D = D\, but 
with 8S e ff non-local. 

Working out D taste in the free theory in momentum space, we find that 



Ln i[Yn ® +m + ^(p 2 + m 2 ) + k Ln [75 ® ^MP\ 



u taste — , , 2m ■ 1 ' «i . n\ > v Z " ' ' 



in which = sinp^, = 2sin(p jU /2), and p = J^^p^- The last term in the numerator of 



Eq. (2.7)is what removes the fermion doublers in the usual taste-basis action [11] (i.e., for a = °°), 



at the price of breaking the taste symmetry explicitly. The taste-invariant Dirac operator D\ in 



Eq. (2.5) is constructed by dropping that term. An important observation is that D\ also has no 
doublers, because of the Wilson-like term proportional to l/a ~ a in the numerator. This is a 
prerequisite for having a taste-invariant theory in the same universality class as the staggered theory, 
making feasible the comparison of pairs of such theories in Sec. 3. 

At this point, it is useful to reflect on the nature of the non-locality in the case at hand. 5 In 
general, the tendency is to "stay away" from non-local field theories, whereas here we argue that 
in this case there is no need to: the continuum limit is in the desired universality class (cf. next 
section), and the non-local behavior at a ^ can be understood in detail (cf. Sec. 4). In the theory 
with the fourth root, there are "too many" pions. While we should end up with eight pions (in the 
theory with three flavors), before taking the fourth root the theory has many more, because of the 
unphysical, extra taste degree of freedom. Of course, taking the fourth root is precisely intended 
to remove the surplus of pions, by inserting the appropriate number of factors 1/4 into various 
terms making up the correlation functions describing the propagation of the pions. If all pions 
are members of exact taste multiplets, this should work, as it would in the theory described by 



4 Note that when we say an effective action is non-local, we mean that it cannot be written as a fermion path integral 
with any local action. 



5 SeealsoRef. [Q. 
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Eq. (1.2). The problem is that at non-zero lattice spacing taste symmetry is broken, so that within 



one taste multiplet, the pion masses are (to leading order in SZPT) given by 

K) 2 = (mf) 2 + cVA 4 ecD , (2.8) 

in which GB denotes the one exact Goldstone boson, and the c^ 4 are numerical coefficients which 
do not vanish except when the taste index A refers to the exact GB. This "mismatch" between 
masses shows up as violations of unitarity, a manifestation of the non-locality of rooted staggered 
fermions at non-zero lattice spacing. For explicit examples, see Refs. [gj, |l^0|. We note that these 
arguments extend to other hadron multiplets as well; we just focused on the pion sector because it 
contains the lightest excitations in the theory. 

One way to rephrase this observation is by noting that there are two independent infrared (IR) 
scales in the theory, one being the physical pion mass m% B governed by the quark mass, and the 
other aAg CD , which is generated by the taste splitting of low-lying eigenvalues of the staggered 
Dirac operator. In other words, there are two different scales which control the IR behavior of the 
theory, a physical one and an unphysical one. As is often the case, the order in which IR scales are 
taken to zero matters, and in this case it is clear that the right order is to first take the unphysical 
scale aAg CD to zero if one wishes to study the chiral limit [JT5], |0. Of course, taking aA^ CD to zero 
is done by taking the continuum limit. In addition, it is clear that, in view of the unitarity violations 
at a ^ 0, one should also take the continuum limit before continuing the theory to Minkowski space. 

3. Renormalization- group analysis of (rooted) staggered fermions 



The (thus far formal) argument based on Eq. (|L2J) says that rooted staggered fermions provide 
a valid regularization of QCD if exact taste symmetry is recovered in the continuum limit. We will 
now build an adequate non-perturbative framework where well-defined statements can be made 
about the continuum limit. Using this framework we will argue that under plausible, and testable, 
assumptions, rooted staggered fermions indeed provide a valid regularization of QCD. 

3.1 Strategy 

In Sec. 2 we discussed how taste-splittings manifest themselves in physical observables. We 
now take a step back and examine taste-symmetry violations at the most fundamental level: in the 
spectrum of the staggered Dirac operator. On gauge-field configurations drawn from a (rooted or 
unrooted) dynamical ensemble, what one expects to find is that the low-lying eigenvalues arrange 



themselves nicely into almost-degenerate taste quartets [16, 17]. But for larger eigenvalues the 
taste symmetry deteriorates until, finally, at the cutoff, taste degeneracy is completely lost. 

If taste symmetry must be lost at the cutoff scale, let's get rid of all cutoff-scale physics. The 
way to eliminate cutoff-scale effects is to apply renormalization-group (RG) block transformations. 
With each blocking step the lattice spacing is doubled, until an effective theory on a coarse lattice 
with spacing a c is reached. As we approach the continuum limit we fix the QCD scale Aqcd 
and the renormalized quark masses by adjusting the bare parameters. We fix the coarse-lattice 
spacing a c too: each time we make an additional blocking step, we simultaneously decrease the 
fine-lattice spacing a* by a factor of two. The limit of infinitely many blocking steps implies the 
usual continuum limit, af — > 0. The coarse-lattice spacing provides a new, intermediate distance 
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scale, a/ <C a c Ag^. RG-blocking eliminates from the theory all the fermion modes with eigen- 
values (well) above the coarse-lattice cutoff l/a c . The remaining eigenmodes, those of the blocked 
Dirac operator, all have eigenvalues that become vanishingly small in units of the underlying cutoff 
1 jaf. Because a c is kept fixed, we expect that all these eigenvalues will uniformly show the quartet 
structure required by taste symmetry. In other words, the blocked staggered Dirac operator will ac- 
quire the taste-diagonal form of Eq. ( L2) in the continuum limit, and its fourth root will correspond 
to a local one-taste theory. 

There are several reasons for choosing a c <C ^-qcd- Fi rst > we ta ke as criterion for the theory 
in the continuum limit to be local the requirement that the coarse-lattice action be local on the 
scale a c . This only makes sense if a c -C ^qcd- We a ^ so want tne complete set of coarse-lattice 
observables to be rich enough to extract all of the QCD physics. Again this requires that l/a c will 
be a high-energy scale relative to the QCD scale. A third reason will be encountered below. 

The blocking framework constructed below is designed to make the most out of our under- 
standing of uncontroversial lattice regularizations of QCD and, in particular, of unrooted stag- 
gered fermions. A fairly standard blocking framework is set up for ordinary (unrooted) staggered 
fermions in a form where the adaptations required for rooted staggered fermions are minimal. 

The novel element is the introduction of comparison blocked-lattice theories with exact taste 
invariance. At each blocking level, a new theory with exact taste symmetry is constructed by simply 
dropping the taste-breaking part of the blocked Dirac operator. This is equivalent to dropping the 
rightmost, non-local term of Eq. (2.6), except that it is done after a blocking process. In the rooted 
theory, the result will be that each of the so-obtained reweighted theories is a local one-taste theory. 
Of course, while all by itself RG-blocking leaves the physics invariant, the reweighted theories 
constructed at different blocking levels are different from one another, as well as from the original 
(rooted or not) staggered theory. 

Taste-breaking effects that survive blocking become smaller and smaller with each additional 
blocking step. As a result, at each blocking level, the difference between a (blocked) staggered 
theory and the corresponding reweighted theory gets smaller. In the (continuum-)limit of infinitely 
many blocking steps, the difference vanishes for every observable. 6 Because each reweighted 
theory is local and belongs to the correct universality class, the same applies to rooted staggered 
fermions in the continuum limit. 

Clearly, this amounts to a set of highly non-trivial claims which require a detailed justification. 
Several key steps of the argument basically work in the same way for the unrooted and the rooted 
staggered theories. Note that for the ordinary (unrooted) theory, they lead to the uncontroversial 
conclusion that the continuum-limit theory consists of four degenerate quark species per staggered 
field. The last crucial step of the argument is, however, more complicated in the rooted theory. 



3.2 The RG blocking framework 

We first introduce our notation. We will perform n + 1 blocking steps labeled k = 0, . . . , n. 
The k th lattice spacing is a& = 2 k+i af where as already mentioned cif is the fine-lattice spacing and 
a c = a n is the coarse-lattice spacing. The k = step, already discussed in Sec. 2, is special. It 
transforms the staggered field from its usual one-component basis to a taste basis, which is then 

6 It is necessary to assume that m ^ for all flavors. 
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retained in all subsequent blocking steps. Thinning out of the fermion and gauge-field degrees of 
freedom occurs at each step, except for the k = step where the fermions are not thinned out but 
roughly speaking only undergo a change of basis. In order to avoid overly cluttered notation we 
will consider a singe-flavor theory in this section. The generalization is trivial. 

We first set up the blocking framework for the ordinary (unrooted) staggered theory. 7 Blocking 
in the rooted theory will be introduced later. The unrooted partition function is 

Z = J ®fy®x®X zM-Sg-JDstagX) , (3-1) 

where x{ x )iX{ x ) i s a one-flavor staggered field, and denotes the link variable. The gauge 
field as a whole will be denoted = {U^^}. Again S g = S g (^ ) is the gauge action and D stag = 
Dstagi^ ) is the staggered Dirac operator. 

The coordinates of the k th step blocked lattice will be denoted x^ k \ The fermion and anti- 
fermion fields on that lattice are Ya! ( x ^ ) an( ^ ¥ai ( x ^ ) respectively. The indices a and i, both 
ranging from one to four, are the Dirac and the taste index respectively. The blocked link variables 
will be denoted V^ k \ k) , and the blocked gauge field as a whole = {V^L, }. 

RG-blocking is performed by multiplying the integrand of the path integral by one, written in 
a sophisticated form, and then interchanging the order of integrations. Fermion blocking is always 
done with a gaussian kernel that, for k > 1, takes the explicit form 



1 = 



J exp [a k (j {k) -rff^Q^) (v W - G ( V* -1) ) 



(3.2) 



Here Ofe is a blocking parameter of mass dimension one, that is naturally taken to be 0(a^ 1 ), and 
Nk is the number of sites of the k th lattice. The blocking kernel Q^> = Q( k ) (y( k ~ 1 )) is ultra-local 
and gauge covariant. It defines a linear mapping from the sites of a 2 4 hypercube on the (k — l) th 
lattice to the corresponding single site of the & th lattice. Apart from the k = step (Sec. 2), acts 
trivially on the Dirac and taste indices. For the gauge field we assume a conventional ultra-local 
blocking kernel whose details are not needed. 

We begin by introducing the kernels for the n + 1 blocking steps. However, we do not integrate 
over any gauge field yet, for a reason that will become clear shortly. We do integrate over all the 
fermion fields except those living on the coarse lattice. This can be done in closed form because 
the fermion integrals are gaussian. The result is 

z = s>yw s>y^ ■ ■ ■ ®yw ex P ( - s g - £ - £ s^j\ 

x J dY {n) dW (n) exp ( - W {n) D n W {n) ) ■ (3-3) 

Here J^ 0) = and J^ k) = J^^),^" 1 )), 1 < k < n, are the gauge-field 

blocking kernels. The UV fermion modes that have been integrated out at the k th step give rise to 
the effective action 

S^ = logdet(G t ). (3-4) 

7 We assume a fixed, finite physical volume. By assumption all quarks are massive, and we expect no subtlety when 
taking the thermodynamical limit. 
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The operators D k and G k 1 may be constructed iteratively (for the k = step see Sec. 2) 

D k = a k -a 2 k Q^G k Q^ , (3.5) 
G^ = D k _ 1 + a k Q^Q^. (3-6) 

Let us briefly summarize the important properties of these operators in the free theory [|l(]]. 
The operator D k is gap-less for m = 0, with its low-lying eigenmodes coming from p ~ 0. In 
Eq. ( |3.6[ ) the addition of the blocking-kernel term lifts the small eigenvalues and creates an 0((X k ) = 
0(l/a k ) gap in the spectrum of . One can then prove iteratively that D k , G^ , and G k are all 
local operators [18]. By this we mean that their matrix elements connecting sites jeW and y( k > 
vanish exponentially with the separation, with an 0{a k ) decay length. 

Expanding the free, coarse-lattice Dirac operator D n to second order in p and m gives 

D n {p)=m + i\jf®l}+a f Y}Y5®^]pl-Rn{m + i[p l ®\]f + ■■■ . (3.7) 
where R n = T! k =o(^)" k / a k * s 0{a c ). The shown taste-breaking term originates from the last term 



in the numerator of Eq. (2.7). Because \p\ < a~ , this irrelevant term is 0(a//a 2 ), which is just 
the scaling required by its engineering dimension. In the limit n — > oo the blocked Dirac operator 
becomes taste-diagonal. For m = 0, the operator D n satisfies a Ginsparg-Wilson (GW) relation. 
Thus, through RG blocking, the taste-violating mechanism for removing the doublers is gradually 
taken over by the taste-conserving and chiral symmetry conserving GW mechanism. 

In both the free and the interacting theories the inverse blocked Dirac operator satisfies 8 

D? = a£ x +Q®D£ X QW. (3.8) 

Using Eq. ( |3T8| ) recursively we may express the coarse-lattice fermion propagator as the fine-lattice 
propagator between smeared sources defined by the product of the blocking kernels. 9 Equation 
(^8) is in fact a special case of a completely general pattern. Considering any operator con- 



structed from the coarse-lattice fields, we may "undo" the blocking by performing the integration 
over all the blocked (gauge-field and fermion) fields we have introduced, resulting in an opera- 
tor &(f> that depends only on the fine-lattice fields. This operation defines a pull-back mapping 
; 0(c) _^ 0(f) _ The pull-back mapping is ultra-local (because the blocking kernels are) and 
it preserves expectation values: ^(f' c > 0( c >y = (^M^. 

The existence of the pull-back mapping means that every coarse-lattice observable is at the 
same time also a fine-lattice observable. This "kinematical" feature has an important dynamical 
consequence: The coarse-lattice observable must preserve the constraints that follow from all the 
fine-lattice symmetries, even if (as it actually happens) some of these symmetries are not manifestly 
preserved by the blocking (for more details, see Ref. pi]). 



We next introduce a family of comparison theories as follows. First, as in Eq. (2.5) we split 
the blocked Dirac operator into its taste-invariant and taste-breaking parts 

£>„ = D„®1 + A n , (3.9) 



For the k = step, Eq. ( 3.8 ) reduces to Eq. (Ej) 



9 This is true up to the contact term in the propagator, which vanishes for ^ . 
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where D n carries no taste index, and A„ is traceless on the taste index. The replacement A„ — > tA n , 
where < t < 1, then allows us to gradually "turn off" all the taste breaking. For t = 0, exact 
taste symmetry is restored by hand. Performing this at the level of the (blocked) partition function 
generates a family of interpolating theories 



z^ er {t) = [ ®t (0) ■ ■ ■ ®r (n) exp ( - s g - £ - £ s; 

J V k=0 k=0 

x J d\i/ (n) dW {n) exp -Ij/^fDn^l + tA,^ y/") 



(3.10) 



In the next subsection we will argue that these interpolating theories are all local. They clearly 
belong to the same universality class as the original staggered theory. 
We now come to the rooted staggered theory defined by 

z root = jgty exp(-Sg) Det 1/4 (D,.,^) . (3.11) 

The gaussian fermion-blocking transformations readily lead to identities for the fermion determi- 
nant that hold for any given set of values of the (original and blocked) gauge fields. This allows us 



to write down the analogue of Eq. (3.3) for the rooted theory 



Z root = I Qfy (0) (1) . . . & y(n) exp ( - - £ _ 1 £ ^k)\ Det l/4 ^ _ (3 U ) 

' V k=Q ^ k=Q J 



Of course, as already noted in Eq. (2.6), after «-step blocking we still cannot express the rooted 
determinant as a local path integral. This is made possible only after dropping the taste-breaking 
part A n completely. We then arrive at a reweighted theory 

1 v~ K 

^eff 



z reweigh = j Qty @y(0) . . . @y{n) exp _ £ S ^ f 

■' V k=0 4 k=0 

x J dq^dq^ exp (^-q^D n q^ , (3.13) 

where we used Det 1 / 4 (D M ® 1) = Det(D„). The coarse-lattice quark fields q( n \ q^ carry no taste 
index. In the next subsection, we will argue that the reweighted theories Z^™"' 8 are local too, and 
that they fall into the right universality class. (This puts on a solid basis the observations made 



below Eq. (1.2).) For rooted (unrooted) staggered fermions, we will show that each observable of 
the n th reweighted (interpolated) theory has the same n — > oo limit as the corresponding observable 
of the blocked staggered theory. 

3.3 Validity of rooted staggered fermions in the continuum limit 

Ordinary staggered fermions define a renormalizable lattice theory. The renormalizability of 
the ordinary staggered theory has not yet been established to all orders. Yet there is no real reason 
to doubt that all-orders renormalizability holds in the unrooted theory. As shown in Ref. [^J, any 
relevant or marginal term not already present in the original staggered action is forbidden by its 
symmetries, and thus will not be induced by loop corrections. 

Renormalizability readily generalizes to the rooted theory. This was first observed in Ref. [p^]. 



For a recent detailed discussion, see Ref. [12]. Consider first a theory with copies, or replicas, 
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of identical staggered fields. The counterterms at any order are polynomials in (the integer) hr. 
Now, in perturbation theory, rooting amounts to multiplying each staggered-fermion loop by 1/4. 
Therefore the same counterterms will suffice to renormalize the rooted theory when we substitute 
iir — > 1/4. While locality is lost with rooting, renormalizability is saved. The theory remains 
tightly constrained by power counting and symmetries. 

Our third motivation for choosing the coarse-lattice spacing to satisfy a c <C A -1 , is that QCD is 
weakly coupled at short distances. The existence of a weak-coupling regime depends on renormal- 
izability, but it does not necessarily require locality. 10 Any four-dimensional lattice gauge theory 
whose Boltzmann weight contains a fermion determinant raised to a real positive power hr will 
have a weak-coupling regime, so long as the one-loop beta function (that depends linearly on ur) 
remains asymptotically free. All we need for the next step is that the blocking process in its entirety 
is taking place in a weak-coupling regime, in both the unrooted and rooted theories. 

(k) 

Next, we argue that the blocked Dirac operator D n and the effective action S e L (obtained 

by removing short-distance fermion modes) are local, and that this is true on both the unrooted 

(k) —i 

and rooted ensembles. Functional derivatives of S e L depend on H k and on (derivatives of) Hk, 
where the operator Hk = [75 <8> ^5]G k l is hermitian In view of the discussion below Eq. (3.6). 



the necessary and sufficient condition for 5^ to be local is that both Hk and its inverse be local 
operators on the lc th lattice scale, in the interacting theory too. Unlike in the free theory, however, 
we cannot rule out the possibility that Hk has no gap in the interacting theory, i.e., that Hk may 
have arbitrarily small eigenvalues. But for the locality of HZ small eigenvalues are harmless if the 
eigenmodes are exponentially localized on the corresponding lattice scale. Therefore, a sufficient 
condition for the locality of H7 , and iteratively for the locality of and Hk+\, is that the mobility 
edge of Hk be 0((Xk) = 0(1/ ak)- Eigenmodes with eigenvalues above the mobility edge are, by 
definition, extended. But as long as the extended-modes spectrum does not reach down to zero, the 
inverse of Hk will stay local. 

A recent study of the mobility edge of the Wilson operator in the super-critical region reveals a 
mobility edge remarkably close to the free-theory gap for moderate values of the bare coupling [pC|]. 
By interpolation, the mobility edge must be even closer to the free-theory gap in a (really) weak- 
coupling regime. Intuitively this can be understood as follows. Eigenvalues far below the free- 
theory gap arise in the presence of "dislocations" in the gauge field. But the pure Yang-Mills action 
on the lc th lattice strongly suppresses dislocations provided that the running coupling g r (ak) is very 
small, in which case the effect of any fermion determinant is subdominant (the large, UV divergent 
part of the fermion determinant is absorbed into the renormalization of g r (cik))- This works in any 
renormalizable and asymptotically free (but not necessarily local) theory, in particular for both the 
ordinary and the rooted staggered ensembles. Having said that, since none of the operators Hk has 
been studied numerically to date, it is clearly important to confirm this part of the argument. 



If D n is local then, trivially, D n and A„ are separately local, cf. Eq. (3.9). Integrating over 



all the gauge-fields except Y^ 1 ' in Eqs. (3.10) and (3.13), we obtain coarse-lattice theories which 



are local. In particular, the reweighted theories (3.13) are one-taste theories that belong to the 
appropriate universality class. 

What remains to be established is that the sequence of reweighted (interpolating) theories has 
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See Ref. | p^ ] for a discussion of non-local but renormalizable two-dimensional field theories. 
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the same n^>°° limit as the blocked rooted (unrooted) theory. We now derive this result assuming 



that, in both the interpolating theories ( p-lOj ) and the reweighted theories ( ]3 . 1 3| ) , the following 
scaling relations are valid in an ensemble average sense: 



D 



-l 



1 



< 

~ m r (a c ) 



a f 



l|A„|| < , 



2 -(n+l) 



(3.14) 
(3.15) 



where m r (a c ) > is the renormalized quark mass. The scaling law < ]3. 1 5| ) of the taste-breaking part 
A„ of the Dirac operator neglects logarithmic corrections. It is the anticipated scaling based on the 
smallest engineering dimension of an irrelevant operator, namely five. 1 1 

In the rest of the argument we focus on the rooted theory. We compare the rooted and the 
reweighted theories at the n th blocking level starting from one-taste operators constructed from the 
coarse-lattice fields of the reweighted theory, i.e., operators of the form ff^ = ff^ n \q^ n \q^ n \'f'^). 
For such operators, we reconstruct rooted observables, using that un-normalized expectation values 
satisfy 12 



exp 



4 

\ reweigh 



trlog(l+A n [z5„(g>l] 



reweigh 



(*w)~-(i +0 tf» 



(3.16) 



where we have used the taste-tracelessness of A„, and where, using Eqs. (|3J4j) and ( |3.15| ), 

a f 2-(" +1 ) 



< 



a^m r (a c ) a c m r (a c ) 



(3.17) 



The (anticipated) scaling of A„ thus implies that e„ — > for n —y oo. The observation here is that, 
first, after sufficiently many blocking steps the expansion of the logarithm is convergent, and sec- 
ond, that in the limit n —* oo the expectation value of is the same for the rooted and for the 
reweighted theories. 13 This is precisely what is needed for the validity of the rooted theory in the 
continuum limit! A corollary is that the continuum-limit theory enjoys every exact symmetry that 
exists in the (rooted) staggered and/or in the reweighted theory. 

3.4 Scaling 

Let us now take a closer look at the scaling laws. A standard perturbative treatment in either 
the unrooted or the rooted theory would predict all the scaling laws we have used: for the running 



1 1 See Ref . [ 2j]] for a first direct study of the scaling of A„ . 

12 Eq. (3.10 only relates a subset of the observables in the rooted theory — those in the physical subspace — to those 



of the reweighted theory. If we want to relate all rooted observables, we can (a) leave the reweighted theory as a theory 
of four rooted, but equivalent, tastes, as was done for the continuum theory in Eq. (1.2), or (b) write the reweighted 
theory as a theory of four unrooted tastes and three ghost tastes, as in Appendix B of Ref. Jl"^. 

13 Strictly speaking, one should take into account the fact that the valence propagators coming from ff( n > are also 
slightly different on the two sides of Eq. (3.16): The rooted theory keeps taste-violating terms in these propagators but 
the reweighted theory does not. The difference is also O(e^) and thus not important to the argument. 
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of g r and m r , and for the scaling of taste-breaking irrelevant terms. But this falls short of what we 
need. 

Scaling laws for irrelevant operators are usually derived in a Symanzik effective-action context 



[]22[], where one imagines integrating out the short-distance fluctuations of all the lattice fields. 
Here, in contrast, the scaling laws must hold in Eq. (3.13) (or inEq. (3.10)), where a whole "tower" 
of gauge fields is still present. How do we know that any scaling law still applies? We have only 
assumed that the scaling laws ( 3.14 ) and ( 3.15 ) hold in an ensemble average sense. Furthermore, we 
have only assumed these scaling laws within the reweighted theories, and these have a local path- 
integral representation. We do not rely on the validity of any scaling laws for irrelevant operators 
directly in the rooted theory. 

Operationally, this means that the scaling laws must hold for the expectation values of oper- 
ators constructed from the coarse-lattice fields of the reweighted theory. Before we can compute 
any scaling law, we must first set up perturbation theory. Usually lattice perturbation theory begins 
with the expansion of the link variables as = exp^gaA^ x ). In a reweighted theory a similar 
expansion will have to be applied to the tower of gauge fields ffl, V^ ' , Y^' , simultane- 
ously. With this, the perturbative expansion can in principle be derived directly from Eq. ( 3.13 ), 

~ (k) 

because the closed-form expressions for D n , D n and S e ^ as functionals of all the gauge fields are 
known. One would then proceed to calculate correlation functions with coarse-lattice fields only 
on the external legs. Because only coarse-lattice gauge fields occur on any external leg, we may 
imagine that the integrations over <W , . . . , 'fv l ~~ x > are always done before the integrating 

over y( n \ Since, in addition, all the external momenta are of order \/a c , this brings us closer 
to a standard RG setup where coarse-lattice observables are computed using a conventional (but 
complicated) coarse-lattice action (that includes for example multi-fermion interactions). As we 
explain below, some differences still remain. But first we turn to a conceptual question. 

There is a crucial difference between the running of g r and m r , and the anticipated scaling of an 
irrelevant operator. The running of relevant and marginal parameters originates directly from the 
short-distance divergences of the theory. Because power-counting and renormalizability survive 
rooting, we have no reason to doubt the validity of rooted perturbation theory for these specific 
scaling laws. This is especially true given that only general features of the running coupling and 
mass parameters are needed, but not any details. 

In contrast, the anticipated scaling of any irrelevant operator (which does not mix with relevant 
or marginal operators) implies the vanishing of that operator in the continuum limit. That some- 
thing should vanish in a certain limit is a more delicate claim. In local theories we have no reason 
to doubt the perturbative prediction, which amounts to the assumption that no non-perturbative 
effects interfere with the vanishing of all amplitudes with an insertion of the irrelevant operator. 
But, the question is, how can we be sure that the non-locality of the rooted theory does not modify 
the scaling of irrelevant operators in undesirable ways not captured by perturbation theory? Once 
again, our solution is to rely only on the predictions of perturbation theory in the local reweighted 
theories. Because the non-locality has been eliminated, we see no reason to trust reweighted pertur- 
bation theory any less than perturbation theory for, say, unrooted staggered fermions. This applies 
in particular to the scaling of (taste breaking) irrelevant operators within the reweighted rooted 
theory, which, we expect, would give rise to Eq. (3.15). 

In the unrooted theory, an analogous analysis starts off with the reweighted unrooted theory, 
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i.e., the t = interpolating theory (cf. Eq. (3.10)), and ends with the uncontroversial conclusion 
that exact taste symmetry is recovered in the staggered theory in the continuum limit. But, in the 
unrooted staggered theory we could instead rely directly on unrooted perturbation theory for the 
scaling of A„, because the unrooted theory itself is local. 

Within reweighted unrooted perturbation theory, the scaling (3.15) should hold as we go down 
from l/af to \ja c . This, in turn, should result from the similar symmetry patterns of the stag- 
gered and the reweighted theories, for n large enough. On the staggered side, both taste symmetry 
and (softly broken) chiral symmetry in the continuum limit are secured by the lattice symmetries. 
On the reweighted side, we have exact taste symmetry by construction. But if the continuum limit 
is to come out right, then the reweighted theory should in addition have an approximate chiral 
symmetry (namely, the additive renormalization of the taste-singlet mass term goes to zero with in- 
creasing n). Indeed, since the t = and t = 1 theories in Eq. ( 3.10 ) are connected by a convergent 
expansion, this must be the case. 

We can summarize our line of reasoning as follows. First, we claim that the reweighted version 
of the unrooted staggered theory is local, and that A„ scales as expected in that theory. If one accepts 
the validity of the unrooted staggered theory this claim can be considered "safe." Then, proceeding 
from the unrooted reweighted theory, which has four tastes and exact U (4) taste symmetry, we 
can consider the theory in which we take the fourth root of the fermion determinant, and obtain 
a local one-taste theory in which A„ still scales as an irrelevant operator. Of course, the gauge 
ensemble is different for the reweighted versions of the rooted and unrooted theories. But A„ 
should remain irrelevant as we move from the unrooted reweighted to the rooted reweighted case 
since perturbation theory should be equally trustworthy for both local theories. Note that this is a 
claim about reweighted theories only, with no reference to the underlying staggered theory. In the 
final step, the rooted staggered theory is reconstructed with the help of Eq. ( |3.16| ), and the scaling 
of A„ in the rooted reweighted theory ensures that the rooted staggered theory has the desired 
continuum limit. 

Finally, we address a more practical issue. We have argued that QCD is correctly described 
by the observables of the coarse-lattice (rooted, staggered) theory in the limit of infinitely many 
blocking steps. Thanks to the pull-back mapping every coarse-lattice observable is at the same 
time a fine-lattice observable and, as such, it can in principle be computed directly on the (rooted) 
staggered ensemble. This amounts to using smeared staggered fermion sources constructed in a 
particular way from all the blocking kernels. The question is, are we allowed to use any other 
(local) fine-lattice interpolating fields, as is done in practice? Normally, one proves that once 
renormalization factors have been correctly taken into account, different interpolating fields must 
give rise to the same physical predictions. But once again we face the difficulty that, because of the 
non-locality, it may be dangerous to rely on rooted staggered perturbation theory. We believe that, 
here too, the resolution is to rely on reweighted perturbation theory. By its very construction, the 
reweighted theory "knows" about fine- and coarse-lattice fields alike, and, therefore, it is capable 
of comparing coarse-lattice and fine-lattice interpolating fields. We expect that the usual statements 
about independence of the physical predictions of any particular choice of the interpolating fields 
will remain true once this comparison is carried out. This can in principle be tested in perturbation 
theory. 



14 



Note that this does not amount to a standard blocking process within the reweighted theory. 
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4. Staggered chiral perturbation theory 

Here we discuss point 3 of the Introduction, namely the low-energy effective theory for stag- 
gered quarks. By definition, such a theory must include discretization effects, in particular taste- 
violations, although we expect the theory will go over into continuum chiral perturbation theory 
(ZPT) in the a — > limit. When the underlying staggered action is unrooted, finding the effec- 



tive theory is straightforward [23, 24]. We call the result "unrooted staggered chiral perturbation 
theory," or uSZPT. On the other hand, the effective theory must be non-trivial in the presence of 
rooting, since we know from Sec. 2 that non-localities are present, and that their effects show up in 
the pion sector. 



In Ref. [24], it was proposed that the rooting could be taken into account at the chiral level 



by locating the presence of sea-quark loops in the meson diagrams of uSZPT and multiplying 



each by 1/4. These loops can be found by quark-flow arguments []25|], but for present purposes 
it is more useful to use the replica trick [p6|], which was already employed in Sec. 3.3 for weak- 
coupling perturbation theory. This version of staggered chiral perturbation theory will be referred 
to as rSZPT. We emphasize that the "r" in rSZPT stands, in the first instance, for "replica." The 
rules of rSZPT, described in more detail below, give a well defined procedure for computing chiral 
amplitudes. The question is, however, whether rSZPT is in fact the proper low-energy theory for 
rooted staggered quarks. We will argue that it is correct; in other other words, we argue that the "r" 
in rSZPT also stands for "rooted." 

Finding the correct chiral theory is important for several reasons. The discussion in Sec. 2 
indicates that the potential problems from rooting show up in the IR, because of the interplay of 
the physical (m^ B ) and unphysical (aAg CD ) IR scales. 15 Since the chiral theory describes the most 
IR part of the theory, it provides a laboratory for studying the effects of non-locality and seeing 
(one hopes) how they go away as a — > 0. Indeed, as mentioned in Sec. 2, rSZPT already shows 
unitarity violations at a ^ [ 13, 14], ^J. Moreover, even if the validity of the rooting procedure were 



rigorously established at the quark level, the chiral theory would still be crucial for controlling the 
chiral and continuum extrapolation of simulation results [p7|]. Since the effects of taste violations 
are significant at lattice spacings available currently and in the foreseeable future, we cannot do 
without a chiral theory that takes such effects into account. Finally, the chiral theory provides a 
non-perturbative handle for relating the unrooted staggered valence sector to the rooted staggered 
sea sector. In particular, if rSZPT is indeed correct, it can be used to show that even though the 
valence sector is unrooted, the theory does not behave as a "mixed" theory in which valence and 
sea quarks have different lattice actions. 

The uSZPT theory for a single unrooted staggered field (i.e., one flavor) is derived through 
0(a 2 ) by Lee and Sharpe [p3|[. They start by using the staggered symmetries to find the Symanzik 
action through &(a 2 ). Spurions can then be introduced for all terms that violate the taste and chiral 
symmetries, and the chiral theory follows. The generalization of this procedure to multiple flavors 



appears in Ref [24]. In the unrooted case, the generalization is straightforward. The proposal to 



take into account the fourth root, is, in the replica approach, the following: 



I5 In the RG analysis of Sec. 3, it is p lausib le that e„ (Eq. ( p.l'fl ) over-estimates the relative size of successive terms 
in the expansion of the logarithm in Eq. (3.16), and that the relative size is actually aA^ CD /m r ~ aA^ CD / (m^ B ) 2 . This 



ratio indeed contains the two IR scales m^ B and aA^ CD . 
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• Replicate the sea-quark degrees of freedom at the chiral level, replacing each by ur identical 
copies, where hr is a positive integer. All hr copies get the same mass, and, if relevant, the 
same source terms. 

• Calculate order by order in the resulting uSZPT, keeping the tir dependence explicit. 

• Replace hr by 1/4 at the end. 

Note that the dependence on hr is completely determined at any finite order in uSZPT, so replacing 
wr by 1/4 is a well-defined procedure. 16 As always in a chiral theory, we treat the low-energy 
constants (LECs) as free parameters for each n R ; we are just trying to find the dependence of 
physical quantities on the LECs in the rooted staggered theory. 

The argument []7|] that rSZPT is the correct chiral theory for rooted staggered quarks stays 
completely within the context of chiral theories. As emphasized by Sharpe [12], this is possible 
because the unfamiliar unphysical features of the rooted theory can be connected to the much 
more familiar, but still unphysical, features of a partially quenched (and unrooted) theory. While 
PQZPT, the chiral theory in the partially quenched case |[T^], is not on as firm a theoretical footing 
as is ordinary ZPT for full (unquenched) theories [28], we have a large body of numerical evidence 
that PQZPT (and the completely quenched version QZPT [29, 25]) are in fact the correct chiral 
descriptions of the corresponding lattice theories. The evidence comes from Wilson, domain wall, 
and (for the quenched case) overlap quarks, as well as staggered simulations. Furthermore, PQZPT 
will be subject to even more stringent numerical tests in the future. So connecting the rooted case 
to the partially quenched case is a useful step forward. 

Reference [0] starts by noting that we know (trivially) how the fourth root works when there 
are four degenerate flavors (tip =4). Since the fourth power of the fourth root reproduces the 
determinant, the rooted np = 4 theory is identical to the unrooted np = 1 theory. We therefore know 
the starting chiral theory, namely the uSZPT of Lee and Sharpe. To get to a non-degenerate 4- 
flavor theory, we can expand around the degenerate, massive point; this is where partial quenching 
is needed. Finally, to get to a theory with np < 4 flavors, one quark at a time can be decoupled. 
Each step in the procedure requires some assumptions, which are plausible but unproven. However, 
most of the assumptions can be tested numerically, and some already have been tested. 

To explain the argument in more detail we need some notation. Let {np ,np ,ur)pqcd be the 
generating functional for a lattice QCD theory with np flavors, nj tastes, and nR replicas of each 
flavor; let (np,np,nR) x be the generating functional of the corresponding chiral theory. Whenever 
nR is shown explicitly it is taken to be a positive integer; the end result of the replica trick is 
indicated by replacing nR with 1 /4. When nR is trivially equal to 1 (because the replica trick is not 
relevant), it is omitted. An unrooted theory is indicated by nj = 4; while for the rooted theory we 
put nj = 1. 

Thus (1,4)lqcd is the theory of a single unrooted staggered field, and (l,4) z is the uSZPT of 
Ref. p3[]. Similarly, (np,A,nR) x is the uSZPT of Ref. [ ]2"4| ] with nR-np sea-quark species. Further, 
(np , \)lqcd is the lattice theory of np rooted staggered fields, and (np , l) x is by definition the low- 
energy theory generated by integrating out all the higher modes in (np ,1)lqcd- As far as we know 



16 In Ref. 101, it was claimed that the dependence on is strictly polynomial. That is in fact an oversimplification, 
since we would like to sum the geometric series of hairpin diagrams to all orders, which introduces factors of »# into the 
masses of flavor-neutral mesons, and hence into the denominators of Feynman integrands. But the key point remains: 
the functional form of the iir dependence is completely determined at any finite order in uSCTT. 
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so far, (rip, 1)* could be horribly non-local, non-unitary, or otherwise sick. The claim, however, is 
that (rip, 1)* is in fact (ftf,4, \) x to any finite order in chiral perturbation theory, where the latter 
simply defines what we mean by rSZPT. 

The first steps of the argument are taken in the np = 4 case. We want to show that 



(4,1)* = (4,4,* 



4>X ' 



(4.1) 



where "=" is used to indicate that the two sides are the same functions of the LECs. As mentioned 
above, we start with a degenerate np = 4 theory, with mass matrix ^# = ml, where I is the identity 
matrix in flavor and taste space. In such as theory, we have 



(4,l)lflCD 



(=ml 



(=ml 



{^A)lqcd 

;i,4)x 



(4,4, I 



4)X 



(=ml 



(4.2) 
(4.3) 



The last equivalence here is manifest order by order in rSZPT, since taking Aur degenerate flavors 
and then putting hr = 1/4 gives the same chiral expansion as in a one-flavor theory. 

To move away from degenerate limit, we add taste-singlet scalar sources s'-i for the sea-quark 
fields: 



JSf (4)1) = ...+mV i (x)V i (x) + V i {x) S ij (x)V j (x) + ... 
JSf(4, 4 ,n s ) = ... + m%(x)X(x)+%{x)s i i{x)^ j (x) + ... , 



(4.4) 



with implicit sums over the flavor indices i,j and the replica index r. In the (4, \ )lqcd theory the 
fourth root is taken after the above sources have been included in the determinant. 

When s ^ 0, (4,4, \) x might not be the right chiral theory, so we define the mismatch by 



{A,\;s) x = {4A\;s) x +V[s\. 



(4.5) 



The mismatch V [s] can be expanded around s = 0, where it is known to vanish. In fact, it can be 
shown [Q] that all derivatives of (4, l;s) x and (4,4, \ ;s) x with respect to the sources are equal: 



n 



„ A ds i "jn(x n ) 



which implies 



(4,1;*) 



n 



v=0 



n 



ds'»}"(x n ) 



ds l "j"(x n ) 



V\s 



(4,4,; 







(4.6) 



(4.7) 



s=0 



Equation ( }4.6[ ) is proved by first relating sea-quark correlation functions in each theory to partially 
quenched valence-quark correlation functions, where the valence quarks have the same lattice ac- 
tion as the sea quarks. This then allows us to set the sea-quark source s = 0, where the equivalence 
of the two chiral theories is known. The proof requires the existence of standard PQXVT for un- 
rooted theories, namely for (1,4)* and (4, 4, «/?)*. We note that partially quenched theories are 
crucially needed because the derivatives with respect to sea-quark sources in a rooted theory result 
in different factors of 1/4 in different contractions, which can never happen in the sea-quark sector 
of an unrooted theory. 
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Thus all derivatives of V [s] vanish at s = 0. If V [s] is analytic in s — up to possible isolated 
singularities — it must vanish everywhere, and (4, l;s) x = (4,4, \;s) x . Since an arbitrary four- 
flavor mass matrix can be obtained by a suitable choice of s, Eq. ( |4.1[ ) is verified. In other words, 
rSZPT (i.e., (4,4, j) x ) is the right chiral theory in the four-flavor case. 

The assumption of analyticity could fail in a two ways. First of all, there could be a phase 
transition when the masses are some finite distance away from the degenerate point. There is 



some evidence from simulations [ ]27p that this does not occur, although it is still possible that a 
phase transition lurks beyond the range of parameters (masses, lattice spacings) that have been 
investigated to date. Analyticity could also fail if there were an essential singularity right at s = 0. 
This is hard to rule out a priori, though it seems unlikely, since we are expanding around a massive 
theory with no obvious IR divergences. At the end of this section, we discuss work in progress 
on deriving rSZPT directly from the lattice theory, using the methods of Sec. 3. If successful, this 
work would (among other advantages) reduce these analyticity concerns. 

To move from tip = 4 to tip = 3, we take one quark mass large. Call it the "charm" quark, 
with mass m c . We first choose m c as large as possible without leaving the region where chiral 
perturbation theory applies. This point is taken, nominally, as m c ~ 2/n!, hys , where m£ ys is the 
physical strange quark mass. For a clean separation of scales, we can temporarily take the other 
three masses much smaller than m£ hys . 

We then integrate out m c from (4,4, \) x . Since this is a perturbative process (order by order in 
rSZPT), there is little doubt that the resulting chiral theory is (3,4, ^) x , just as continuum SU (2)l x 
SU (2)r XVT results from integrating out the strange quark from the SU (3)l x SU (3)r theory [f3C|]. 



Nevertheless, an explicit check of this step is in progress [31]. 

Since, by the previous steps in the argument, (4,4, \) x describes the long-distance physics 
of the rip = 4 theory, (3,4, \) x describes that physics when m c ~ Itrvf 1 ^ . We now assume that 
this decoupling of the charm quark from the long-distance physics remains true (up to the usual 
renormalizations of relevant and marginal operators) as m c increases still further, until m c ^> 1 / a. 
At that point, m c is much larger than all the eigenvalues of the Dirac operator, and charm must 
decouple from the lattice theory, leaving (3,4, \ )lqcd- Under our assumptions, we then have 

(3,l) z = (3,4,i) z . (4.8) 

So rSZPT is the correct chiral theory for three rooted staggered flavors. We can then repeat the 
steps to argue (2, l) x = (2,4, \) x , and (1, \) x = (1,4, \) x . 

These results immediately suggest an apparent paradox, which is seen most clearly in the 
np = 1 case. The theory with one flavor should have only a heavy pseudoscalar, which we call the 
f] 1 , and no light pseudo-Goldstone bosons. Yet the theory with one rooted staggered flavor contains 
light pions, whose masses vanish in the continuum chiral limit, as well as one heavy meson, the 
taste-singlet T7 7 '. There are different weightings of the contributions of such particles in the chiral 
theories of the rooted and unrooted cases, but otherwise rSZPT and uSZPT are similar. 

We first note that the contributions of the unwanted particles must disappear from physical 
correlation functions in the continuum limit, either by decoupling, or by canceling against each 
other, or both. This follows from the argument around Eq. (|Q|): In the continuum limit taste 
symmetry is exact, and the physical sector (defined here as the correlation functions generated by 
taste-singlet sources) is exactly equivalent to a continuum theory of a single flavor. What rSZPT 
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adds to the discussion is that it allows us to see exactly how the decouplings and cancellations take 
place as the continuum limit is approached. 

An rSZPT calculation of the np = 1 taste-singlet scalar-scalar correlator has been performed 
to one loop [[14], There are two kinds of unphysical contributions that appear at a ^ 0. Terms 
coming from the taste- violating hairpins [24] involve taste- vector and taste-axial- vector pions, and 
are proportional to explicit powers of a 2 — they simply vanish in the continuum limit. There are 
also contributions from the physical, taste-singlet hairpins due to the anomaly, as well as from 
connected (non-hairpin) meson correlators. These generate a two-T^ intermediate state with the 
physical weight, but also two-pion states. When we let n R — > 1/4, the latter states have relative 
weights: 1, 4, 6, 4, -15, for the taste-pseudoscalar (Goldstone), axial, tensor, vector, and singlet 
pions, respectively. 17 The existence of the term with negative weight here is a clear indication of 
unitarity violations. At a ^ 0, these pions have different masses due to taste-symmetry violation 
and have a non-zero contribution to the correlation function. The sum of the weights is 0, however, 
so the contributions cancel in the continuum limit when all the pions become degenerate. 

If, as argued, rSZPT is the correct chiral theory for rooted staggered quarks, then there are 
some important consequences. When a— >0, (np,4,nn) x becomes ordinary XPT for 4np ■ hr quark 
species. Therefore, taking hr — > 1/4 order by order produces standard, continuum XPT for np 
flavors in the physical sector. 18 This then implies that the lowest-energy regime of Tif-flavor lattice 
QCD with rooted staggered quarks becomes indistinguishable in structure in the continuum limit 
from ordinary rip -flavor QCD. In this limit, there will therefore be no unitarity or locality violations 
in the physical sector of the chiral theory. Of course, assuming the arguments of Sec. 3 go through, 
this had to work, since the continuum limit of the rooted staggered lattice theory is true QCD. 

Another use of rSZPT is more technical. It has been suggested (e.g., in Ref [32]) that the 
theory of rooted staggered sea quarks and (necessarily unrooted) staggered valence quarks is a 
"mixed" theory, which behaves like it has different lattice actions for the valence and sea sectors. 
If that were the case, it would be rather unpleasant: Among other features, mixed theories have 
different renormalizations of sea and valence quark masses, and even if the quark masses are tuned 
to make the sea-sea and valence-valence mesons degenerate, the valence-sea mesons will be split 
from the others by discretization errors [^3|] . 

In weak-coupling perturbation theory, it is easy to see with the replica trick that the rooted 
staggered theory does not behave like a mixed theory: the renormalizations in the sea and valence 
sectors are the same [Q, g|. Furthermore, rSZPT allows us to argue that the theory is not mixed at 
the chiral level either. The point is that (np,4,^) x is obtained order by order from (np ,4,hr) x . The 
latter theory in turn has lattice symmetries (broken only by mass terms) that interchange valence 
and sea quarks. These symmetries imply that the theory of rooted staggered sea quarks and stag- 
gered valence quarks behaves like a partially quenched theory, not a mixed theory. One can give 
different masses to the sea and valence quarks, as usual in a partially quenched situation. However, 
if one chooses to give valence and sea quarks the same masses, then the symmetries forbid, for 
example, the splitting of valence-sea from valence-valence or sea-sea mesons. 

Finally, we briefly describe our work in progress [34], in which we attempt to derive rSZPT 



17 See Sec. VI of Ref. [Q] for a detailed discussion of how these weights arise in rSCTT. 
18 We are considering 
negative masses, see Refs 



18 We are considering here the case of positive quark masses only. For a discussion of the issues involved with 
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directly from the rooted staggered lattice theory. The immediate problem in such a derivation is that 
a straightforward replica trick fails at the non-perturbative QCD level because we have no control 
over the functional dependence of the theory on hr. In weak-coupling QCD perturbation theory 
the dependence on hr arises only from the counting of sea quark loops, and thus is manifest order 
by order. But once we move beyond perturbation theory, the hr dependence is not known a priori, 
and a unique analytic continuation from integer ur to hr = 1 /4 is not possible. 

This problem with the replica trick at the QCD level is to be contrasted with the trick at the 
chiral level: rSXVT makes sense because the dependence on ur is known when we calculate order 
by order in chiral perturbation theory. The unknown ur dependence of the QCD level is hidden 
in the n R dependence of the LECs of the corresponding chiral theory. Since the goal in the chiral 
theory is simply to calculate physical quantities as functions of the LECs, which are treated as 
independent variables, we can (must!) ignore the hidden hr dependence of the LECs in rSZPT. 

Our tentative solution to this problem is to use the reweighted theory of Sec. 3 as an interme- 
diate step. Since this is a local theory, finding the corresponding chiral theory is straightforward. 
We then need the replica trick only to move from the reweighted theory to the rooted staggered 
case. The latter can be obtained by a convergent Taylor expansion (in the parameter t introduced 



after Eq. ( p.9| )) around the former. This makes it possible to control the ur dependence at the QCD 
level, which should allow us to establish an unambiguous connection between the QCD and chiral 
levels, and thereby to derive rSZPT directly. It should be noted, however, that such a derivation 
would still rely on the existence of PQZPT for standard (local) partially quenched theories. 
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